TRACDS: Temporal Relationship Among Clusters for Data Streams

نویسندگان

  • Michael Hahsler
  • Margaret H. Dunham
چکیده

In this paper we propose a new extension to clustering data streams based on the Temporal Relationship Among Clusters for Data Streams (TRACDS). This is not a new clustering algorithm, but rather a way to capture the temporal relationships among clusters that is inherent in the ordering of observations in the data stream. We propose to capture this ordering relationship among the clusters by overlaying clusters created by any data stream clustering algorithm with a Markov Chain (MC). The states in the Markov Chain represent the clusters and the transitions are the relationships between clusters. For the TRACDS framework we identify the basic clustering operations used by any state-of-the-art data stream clustering algorithms and show how these operations can be mirrored on the MC using a set of TRACDS operations. This approach makes TRACDS general enough to be built on top of any clustering algorithm. We evaluate the improvement of TRACDS over just data stream clustering for outlier detection using several synthetic data sets to simulate different types of temporal structure as well as different levels of data quality (i.e., data points arrive outof-order). and Retrieval Clustering

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal Structure Learning for Clustering Massive Data Streams in Real-Time

This paper describes one of the first attempts to model the temporal structure of massive data streams in real-time using data stream clustering. Recently, many data stream clustering algorithms have been developed which efficiently find a partition of the data points in a data stream. However, these algorithms disregard the information represented by the temporal order of the data points in th...

متن کامل

Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain

This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...

متن کامل

Density Micro-Clustering Algorithms on Data Streams: A Review

Data streams are massive, fast-changing, and infinite. Applications of data streams can vary from critical scientific and astronomical applications to important business and financial ones. They need algorithms to make a single pass with limited time and memory. Mining data streams is concerned with extracting knowledge structures represented in models and patterns in non-stopping data streams....

متن کامل

SCUBA: Scalable Cluster-Based Algorithm for Evaluating Continuous Spatio-temporal Queries on Moving Objects

In this paper, we propose, SCUBA, a Scalable Cluster Based Algorithm for evaluating a large set of continuous queries over spatiotemporal data streams. The key idea of SCUBA is to group moving objects and queries based on common spatio-temporal properties at runtime into moving clusters to optimize query execution and thus facilitate scalability. SCUBA exploits shared cluster-based execution by...

متن کامل

A TMO-based approach to tolerance of transmission jitters in tele-audio services

One of the key issues in multimedia processing, especially, in distributed processing of multimedia data streams on a network of computing platforms, is still the accurate maintenance of the temporal relationship among data elements of various media types [Aya01, Bla96, Cel00, Her98, Hun01, Ste96, Bla96, KSK02]. Moreover, the amount of effort consumed by the designer and the programmer in the s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010